منابع مشابه
Sequential composition for control of underactuated systems
We present a new approach to developing hybrid feedback policies for the control of systems with nonholonomic constraints. We extend the idea of sequential composition and use it to to switch between controllers in a state based manner, resulting in a globally convergent, pure feedback policy. Individual controllers in the palette are inspired by variable constraint control. We solve a vision g...
متن کاملLearning Control Composition in aComplex
In this paper, reinforcement learning algorithms are applied to a foraging task, expressed as a control composition problem. The domain used is a simulated world in which a variety of creatures (agents) live and interact, reacting to stimuli and to each other. In such dynamic, uncertain environments , fast adaptation is important, and there is a need for new architectures that facilitate on-lin...
متن کاملComparing Bandwidth and Self-control Modeling on Learning a Sequential Timing Task
Modeling is a process which the observer sees another person's behavior and adapts his/her behavior with that which is the result of interaction. The aim of present study was to investigate and compare effectiveness of bandwidth modeling and self-control modeling on performance and learning of a sequential timing task. So two groups of bandwidth and self-control were compared. The task was pres...
متن کاملLearning Sequential Composition Plans Using Reduced-Dimensionality Examples
Programming by demonstration is an attractive model for allowing both experts and non-experts to command robots’ actions. In this work, we contribute an approach for learning precise reaching trajectories for robotic manipulators. We use dimensionality reduction to smooth the example trajectories and transform their representation to a space more amenable to planning. Next, regions with simple ...
متن کاملApplication of Sequential Reinforcement Learning to Control Dynamic Systems
The article describes the structure of a neural reinforcement learning controller, based on the approach of asynchronous dynamic programming BBS93]. The learning controller is applied to a well-known benchmark problem, the cart-pole system. In crucial diierence to previous approaches, the goal of learning is not only to avoid failure, but moreover to stabilize the cart in the middle of the trac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Cybernetics
سال: 2016
ISSN: 2168-2267,2168-2275
DOI: 10.1109/tcyb.2015.2481081